Variant genotyping with gap filling
نویسندگان
چکیده
Although recent developments in DNA sequencing have allowed for great leaps in both the quality and quantity of genome assembly projects, de novo assemblies still lack the efficiency and accuracy required for studying genetic variation of individuals. Thus, efficient and accurate methods for calling and genotyping genetic variants are fundamental to studying the genomes of individuals. We study the problem of genotyping insertion variants. We assume that the location of the insertion is given, and the task is to find the insertion sequence. Insertions are the hardest structural variant to genotype, because the insertion sequence must be assembled from the reads, whereas genotyping other structural variants only requires transformations of the reference genome. The current methods for constructing insertion variants are mostly linked to variation calling methods and are only able to construct small insertions. A sub-problem in genome assembly, the gap filling problem, provides techniques that are readily applicable to insertion genotyping. Gap filling takes the context and length of a missing sequence in a genome assembly and attempts to assemble the intervening sequence. In this paper we show how tools and methods for gap filling can be used to assemble insertion variants by modeling the problem of insertion genotyping as filling gaps in the reference genome. We further give a general read filtering scheme to make the method scalable to large data sets. Our results show that gap filling methods are competitive against insertion genotyping tools. We further show that read filtering improves performance of insertion genotyping especially for long insertions. Our experiments show that on long insertions the new proposed method is the most accurate one, whereas on short insertions it has comparable performance as compared against existing tools.
منابع مشابه
A common variant of endothelial nitric oxide synthase (Glu298Asp) is associated with collateral development in patients with chronic coronary occlusions
BACKGROUND Experimental studies support an important role for endothelial nitric oxide synthase (eNOS) in the regulation of angiogenesis. In humans, a common polymorphism exists in the eNOS gene that results in the conversion of glutamate to aspartate for codon 298. In vitro and in vivo studies have suggested a decreased NOS activity in patients with the Asp298 variant. We hypothesized that a g...
متن کاملA common African variant of human connexin 37 is associated with Caucasian primary ovarian insufficiency and has a deleterious effect in vitro
Folliculogenesis requires communication between granulosa cells and oocytes, mediated by connexin-based gap junctions. Connexin 37 (Cx37)-deficient female mice are infertile. The present study assessed Cx37 deficiency in patients with primary ovarian insufficiency (POI). A candidate gene study was performed in patients and controls from the National Genotyping Center (Evry, France) including 58...
متن کاملGenotyping of Infectious bronchitis viruses isolated from broiler chicken farms in Iran during 2015-2016
BACKGROUND: Avian infectious bronchitis is considered as an important viral disease worldwide. Genotyping based on the S1 subunit of spike protein gene of the causative agent, avian infectious bronchitis virus, can be used to classify IBV isolates. Objective: This survey was carried out to characterize the infectious bronchitis virus (IBV) genotypes circulating in Iran and determine their preva...
متن کاملLikelihood-Based Gene Annotations for Gap Filling and Quality Assessment in Genome-Scale Metabolic Models
Genome-scale metabolic models provide a powerful means to harness information from genomes to deepen biological insights. With exponentially increasing sequencing capacity, there is an enormous need for automated reconstruction techniques that can provide more accurate models in a short time frame. Current methods for automated metabolic network reconstruction rely on gene and reaction annotati...
متن کاملMATHEMATICAL MODELLING OF THE EFFECT OF FOAM DEGRADATION ON MOULD FILLING IN THE GREY IRON EPC PROCESS
In this investigation a new model was developed to calculate gas pressure at the melt/foam interface (Gap) resulting from foam degradation during mould filling in the Lost Foam Casting (LFC) process. Different aspects of the process, such as foam degradation, gas elimination, transient mass, heat transfer, and permeability of the refractory coating were incorporated into this model. A Computati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 12 شماره
صفحات -
تاریخ انتشار 2017